Deep 360 Pilot: Learning a Deep Agent for Piloting through 360° Sports Videos

نویسندگان

  • Hou-Ning Hu
  • Yen-Chen Lin
  • Ming-Yu Liu
  • Hsien-Tzu Cheng
  • Yung-Ju Chang
  • Min Sun
چکیده

Watching a 360 sports video requires a viewer to continuously select a viewing angle, either through a sequence of mouse clicks or head movements. To relieve the viewer from this “360 piloting” task, we propose “deep 360 pilot” – a deep learning-based agent for piloting through 360 sports videos automatically. At each frame, the agent observes a panoramic image and has the knowledge of previously selected viewing angles. The task of the agent is to shift the current viewing angle (i.e. action) to the next preferred one (i.e., goal). We propose to directly learn an online policy of the agent from data. Specifically, we leverage a state-of-the-art object detector to propose a few candidate objects of interest (yellow boxes in Fig. 1). Then, a recurrent neural network is used to select the main object (green dash boxes in Fig. 1). Given the main object and previously selected viewing angles, our method regresses a shift in viewing angle to move to the next one. We use the policy gradient technique to jointly train our pipeline, by minimizing: (1) a regression loss measuring the distance between the selected and ground truth viewing angles, (2) a smoothness loss encouraging smooth transition in viewing angle, and (3) maximizing an expected reward of focusing on a foreground object. To evaluate our method, we built a new 360-Sports video dataset consisting of five sports domains. We trained domain-specific agents and achieved the best performance on viewing angle selection accuracy and users’ preference compared to [53] and other baselines.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Supplementary Material: Deep 360 Pilot: Learning a Deep Agent for Piloting through 360◦ Sports Videos

r(lt(i), l gt t ) = { 1− ‖lt(i)−l gt t ‖2 η , if ‖lt(i)− l gt t ‖2 <= η −1, otherwise (1) where η equals the distance from the center of a viewing angle to the corner of its corresponding NFoV, i.e., √ 32.752 + 24.562 = 40.9 if we define NFOV as spanning a horizontal angle of 65.5◦ with a 4 : 3 aspect ratio. When lt == l gt t , the reward is 1, which is the maximum reward. When ‖lt(i) − l t ‖2 ...

متن کامل

A Deep Ranking Model for Spatio-Temporal Highlight Detection from a 360 Video

We address the problem of highlight detection from a 360◦ video by summarizing it both spatially and temporally. Given a long 360◦ video, we spatially select pleasantlylooking normal field-of-view (NFOV) segments from unlimited field of views (FOV) of the 360◦ video, and temporally summarize it into a concise and informative highlight as a selected subset of subshots. We propose a novel deep ra...

متن کامل

360° View Camera Based Visual Assistive Technology for Contextual Scene Information

360° View Camera Based Visual Assistive Technology for Contextual Scene Information Mazin Ali Supervising Professor: Dr. Ferat Sahin In this research project, a system is proposed to aid the visually impaired by providing partial contextual information of the surroundings using 360° view camera combined with deep learning is proposed. The system uses a 360° view camera with a mobile device to c...

متن کامل

HindSight: Enhancing Spatial Awareness by Sonifying Detected Objects in Real-Time 360-Degree Video

Our perception of our surrounding environment is limited by the constraints of human biology. The field of augmented perception asks how our sensory capabilities can be usefully extended through computational means. We argue that spatial awareness can be enhanced by exploiting recent advances in computer vision which make high-accuracy, real-time object detection feasible in everyday settings. ...

متن کامل

Spatiotemporal Rate Adaptive Tiled Scheme for 360 Sports Events

The recent rise of interest in Virtual Reality (VR) came with the availability of commodity commercial VR products, such as the Head Mounted Displays (HMD) created by Oculus and other vendors. One of the main applications of virtual reality that has been recently adopted is streaming sports events. For instance, the last olympics held in Rio De Janeiro was streamed over the Internet for users t...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2017